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Abstract: Mirids are the primary pests affecting cocoa production in Africa, but no genetic 
studies have been conducted on these insects. Here we report the isolation and 
characterization of 12 polymorphic microsatellite loci for Sahlbergella singularis. 
A microsatellite-enriched genomic DNA library was developed and screened to identify 
marker loci. Twelve polymorphic loci were identified by screening 28 individuals collected 
from one presumed population in cocoa plantations in Southern Cameroon. The number of 
alleles ranged from 5 to 25, whereas the observed and the expected heterozygosities ranged 
from 0.179 to 0.786 and from 0.671 to 0.946, respectively. Tests showed significant 
deviations from HW equilibrium for four loci, but no linkage disequilibrium was detected 
at any of the loci. No cross-species amplification was observed in two other mirid pests 
in Africa. 
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1. Introduction 

The mirid Sahlbergella singularis Hagl. (Hemiptera: Miridae: Bryocorinae) is one of the primary 
pests affecting cocoa (Theobroma cacao L.) production in Africa, associated with 25 to 40% 
production losses. Sahlbergella singularis is widely distributed in West Africa, present throughout the 
forest zone, from Sierra Leone to the Demographic Republic of Congo [1], and its life history is well 
known on cocoa [1,2]. However, knowledge of S. singularis population structure in cocoa plantations 
is incomplete. About one century ago, mirids adapted to cocoa, a newly introduced cash-crop in 
West Africa. Its natural host-plants are mainly forest trees of the Malvaceae [1]. Although these trees 
are frequent neighbors or shade trees in cocoa agroforestry systems [3], their role in mirid population 
dynamics is unknown. 

Molecular genetic techniques provide powerful tools for the study of insect ecology and population 
genetics in natural environment. DNA fingerprinting using micro satellite loci provides information on 
genetic variation at the population level allowing identification of biotypes and characterization of 
population structure, gene flow, and dispersal [4]. The present paper reports the isolation and 
characterization of micro satellite markers for Sahlbergella singularis. We also tested cross-species 
amplification for the closely related species Distantiella theobroma Distant. (Hemiptera: Miridae), 
the second most important pest of cocoa in West Africa [1], and for a mirid bug of sorghum, 
Eury stylus oldi Poppius (Hemiptera: Miridae). 

2. Results and Discussion 

Twelve primer pairs out of 28 tested showed a good amplification pattern and polymorphism in the 
28 S. singularis individuals sampled from one presumed population for the study. The number of 
alleles per locus ranged from 5 to 25, while the observed and the expected heterozygosities ranged 
from 0.179 to 0.786 and from 0.671 to 0.946, respectively (Table 1). LD tests showed that there was 
no significant genetic association for all the 66 pairwise combinations of loci. The test showed 
significant deviations from HW equilibrium (P < 0.05) for four loci (Ssl4, SsOl, Ss4 and SslO). 
Analysis performed with MICRO-CHECKER showed that the general deficit in heterozygotes was 
distributed across most allele size classes excluding genotyping errors as a source of deviation from 
HW equilibrium. Our results showed that a significant excess of homozygotes for loci Ssl4 and SslO 
might be due to high prevalence of null alleles (i.e., >20%; Table 1). Other loci, including SsOl, 
showed null allele frequencies <10% but Ss4 displayed a moderate frequency of 11%. The primary 
cause of null alleles is mutations in the primer-binding region flanking micro satellite sequences [5,6]. 
Null alleles are common in insects and similar studies for other mirid species also indicated signs of 
null alleles at several loci [7-9]. 
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Table 1. Twelve microsatellite loci isolated in Sahlbergella singularis collected from Southern Cameroon: locus name, PCR multiplex set no., 
repeat motif and allele size for the cloned allele, allele size range, number of alleles (N- d ), observed (H 0 ) and expected (H e ) heterozygosities, 
mean null allele frequency (r nu n), primer sequences and GenBank Accession no. 





PCR 
multiplex set 


Reneat motif 


Allele 

size (bp) 


Allele size 
range (bp) 




H 


H 


r null 

(%) 


Primer spfliienoc* fS'-'VI 


Genbank 
Accession N° 


Ss14 


I 


(GA) 30 CA(GA) 2 
AA(GA) 3 


734 


905-278 


16 

1 \J 


0 467 


0 91 7 * 


73 


F: Pef-CTGGAAATGGGTAGGGGATT 
R: GACAGGGTAGTCGGCAAGAC 


IO687707 


Ss24 


I 




188 


153-247 


17 


0.720 


0.904 


9 


F: AW-AAACACGACTTTTCCCTTAC 
R: AGCTAAAATGCTATCTCTGC 


IO687706 


Ss01 


2 




739 


71 fi_7S7 


1 8 

1 O 


0 750 


0 977 * 


7 


F: fam-TCCGAGGGAAACCTTCCTAT 
R: ACGTTATGCAGCACCGATTA 


JQ687216 


Ss1 1 


2 






1 1 Q_1 S1 

117 1J1 


12 


0 750 


0 871 

\J.O 1 1 


4 


F: Fam-GTCCATGCGAGCTGATGTT 
R: CGTCTCTCCTGCTTCATACG 


JD68771 3 


Ss1 5 


2 




161 


17S-194 

1 LJ 1 yT - 


25 


0 741 

\J. 1 "T 1 


0 946 


10 

1 V/ 


F: Pef-CGAAGCCAAGCGTATATTCC 
R: TGCGAGGTCGATAGTTTGAA 


IO687708 


Ss04 


3 


(CT) 6 AT(CT) V 
(CA) 3 CG(CA) 6 


217 


187-234 


17 


0.643 


0.851 * 


11 


F: Fam-GGATGTTCCTTTACCGCTTT 
R: ACATGAATAGCGTGAGATTCC 


IO687710 


Ss05 


3 


(GT),, 


120 


104-125 


9 


0.714 


0.872 


8 


F: Fam-CTAGTGATGGTATGTAATCAGC 
R: GTGAACTCTACAAGGGATAATG 


10687217 


Ss17 


3 




1 98 

1 70 


lOJ ZsZ^y 


1 0 

1 \J 


0 609 

V./ . V V./ v 


0 788 

\J. 1 OO 


1 0 

1 \J 


F: V/c-ACAACCAAGCTGATGTTTCG 
R: TCATTCATTACAGTGCCTCTTG 


IH687714 


Ss06 


4 


(CA) 6 


100 


96-102 


5 


0.536 


0.671 


4 


F: V/c-TATAGGGCCAGGGGTAGACA 
R: AAAGGGCTGTAATCGAAATGC 


JQ687215 


Ssl9 


4 


GACGAG(GA) 17 
GG(GA) 2 


160 


130-183 


14 


0.786 


0.865 


4 


F: V/c-CAGCAATGTCTTAATGTTCGAC 
R: TTGAAGCAGTGGCTCTTAATG 


JQ687211 


SslO 


na 


(GT) 15 (GTGA) 3 
(GAAT) 3 


101 


76-127 


8 


0.179 


0.706 * 


31 


F: fam-GCTGGGTATTTGAGAGGGATT 
R: CGCCAGATGAATAATAAAGACG 


JQ687212 


Ss25 


na 


(AC) 27 


231 


191-231 


13 


0.778 


0.861 


4 


F:famCGTTATCAGTATCATTCGAGCAGT 
R: GTTAGTCCTCGCCGCATCT 


JQ687209 



* indicates significant deviations from Hardy-Weinberg equilibrium after sequential Bonferroni correction {p < 0.05). 
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None of the S. singularis microsatellite primers used in this study amplified in either of the other 
two mirid species, which suggests that the primers designed for S. singularis in our study may be 
species-specific, with limited cross-species applicability. 

3. Experimental Section 

3.1. Isolation of Microsatellite Markers 

Genomic DNA was extracted based on Risterucci et al. [10] from 5 pooled individuals (3 adults and 
2 nymphs) collected on a cocoa farm in the Centre Region of Cameroon near Yaounde (3°52'N; 
11°28'E), and conserved in alcohol. A microsatellite-enriched genomic DNA library was developed 
following the method of Billotte et al. [11]: total genomic DNA was digested with the Rsal. DNA 
fragments were purified and ligated with T4 DNA ligase (Gibco-BRL) to Mlul self-complementary 
adaptators (RSA21 5'-CTCTTGCTTACGCGTGGACTA-3' and phosphorylated RSA25 
5'-TAGTCCACGCGTAAGCAAGAGCACA-3'). The selection of microsatellite sequences was 
performed following a hybridization-based capture method using biotin-labeled microsatellite 
oligoprobes and streptavidin-coated magnetic beads. The selected fragments were amplified using 
RSA21 primer. The resulting amplification products were cloned into pGEM-T Easy vector (Promega, 
Madison, USA), and transformed using Epicurian coli XLl-Blue MRF super-competent cells 
(Stratagene, www.genomics.agilent.com). One hundred ninety-two recombinant colonies were 
amplified with RSA21 primer. The size of inserts was estimated using agarose gel electrophoresis of 
PCR products. The electrophoresed PCR products were alkaline-Southern transferred onto Hybond N+ 
nylon membranes (Amersham, www.gelifesciences.com). Clones containing microsatellite alleles 
were selected by hybridization with a P radiolabeled (GA)is and (GT)i5 synthetic microsatellite probe. 
We sequenced the 93 recombinant clones that showed a strong hybridization signal and the SSR 
Analysis Tool (SAT) [12] was used to collect sequence information and facilitate the design of PCR 
primers and evaluation of flanking sequences. The following criteria were considered for sequence 
selection: uniqueness, adequate flanking sequence size, and a lack of repetitive elements in the 
flanking regions. Final primer pairs were designed using Primer 3 [13]. 

3.2. Primer Validation 

Levels of locus polymorphism were assessed in 28 S. singularis individuals, sampled from a 
presumed population on cocoa, in a restricted area of the Bakoa village (4°34.3'N; 11°10.0E) in the 
Centre Region of Cameroon. Since evidence was produced that individuals on a cocoa tree may be 
related [2], we collected a single individual per tree. PCR amplification was performed with a 
thermocycler TC-512 (Techne) following a Touchdown procedure [14] with an annealing temperature 
decrease of 0.5 °C per cycle during the first 10 cycles of the PCR (from 60 °C to 55 °C). PCR started 
with an initial activation step at 95 °C for 15 min followed by 35 cycles with denaturation at 94 °C for 
30 s, annealing for 90 s, extension at 72 °C for 75 s and a 20 min final extension step at 72 °C. 

Loci were combined in sets which were independently co-amplified using a multilocus 
amplification Kit (Qiagen) in a 10 uL volume containing lx Qiagen Multiplex Master Mix (+Q), 0.2 uM 
of each primer and 2 uL of genomic DNA (20 ng/uL). Forward primers were labeled with FAM, VIC, 
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or NED fluorescent labels. Labeled fragments were then discriminated using a capillary sequencer ABI 
3500 (Applied Biosystems) with the size standard GeneScan 500 Liz. Allele sizes were determined 
using GENEMAPPER version 4.1 (Applied Biosystems). 

Levels of expected and observed heterozygosities were computed using GENECLASS2 version 
2.0.h [15]. Hardy- Weinberg (HW) tests for each locus and linkage disequilibrium (LD) tests for each 
pair of loci were performed using GENEPOP version 4.1 [16]. P-values of HW and LD tests were 
corrected with the sequential Bonferroni adjustment [17]. Mean null allele frequencies were computed 
using FREENA [6]. The most probable causes of deviation from HW equilibrium were determined 
among various genotyping errors and the presence of null alleles with MICRO-CHECKER 
version 2.2.3 [18]. 

3.3. Cross-Species Transferability 

Cross-species amplification was attempted for 7 individuals of Distantiella theobroma, collected on 
Bombax sp. (Malvaceae) near Bokito (4°30.0'N; 11°04.8E), and 6 individuals of Eurystylus oldi, 
collected on sorghum at Niamey, Niger (13°30.8'N; 2°06.7E). Extraction and PCR amplification were 
performed as described for S. singularis. 

4. Conclusions 

To the best of our knowledge, no microsatellite markers have been developed for African cocoa 
mirids. A recent publication reports the description of 6 polymorphic loci for the polyphagous mirid 
Bryocorinae Helopeltis theivora Waterhous, a pest of cocoa in Asia [19]. Here we described twelve 
microsatellite loci for potential use in genetics studies of Sahlbergella singularis populations. 
Our work will improve the management of this important pest that affects cocoa production in Africa 
through better knowledge of mirid ecology in cocoa plantations. Conservation of primer sequences 
was not observed for the two other mirid species tested. 
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